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Infinitely generated semigroups and polynomial complexity 

J.C. Birget 


Abstract 

This paper continues the functional approach to the P-versus-NP problem, begun in [5]. Here 
we focus on the monoid TZM^ of right-ideal morphisms of the free monoid, that have polynomial 
input balance and polynomial time-complexity. We construct a machine model for the functions 
in TZM 2 , and evaluation functions. We prove that TZM^ is not finitely generated, and use this to 
show separation results for time-complexity. 


1 Introduction 

In [2] we defined the monoids of partial functions fP and TZAi^- The question whether P = NP is 
equivalent to the question whether these monoids are regular. The monoid fP consists of all partial 
functions A* —)■ A* that are computable by deterministic Turing machines in polynomial time, and that 
have polynomial I/O-balance. The submonoid TZAi^ consists of the elements of fP that are right-ideal 
morphisms of A*. One-way functions (according to worst-case time-complexity) are exactly the non¬ 
regular elements of fP. It is known that one-way functions (according to worst-case time-complexity) 
exist iff P / NP. Also, / G TZAi^ is regular in fP iff / is regular in TZAi^- Hence, P = NP iff fP is 
regular, iff TZAi^ is regular. We refer to [HI [l5] for background on P and NP. 

The original motivation for studying TZAi^ in addition to fP was that TZAi^ is reminiscent of the 
Thompson-Higman groups mi EH ESI min |5] and the Thompson-Higman monoids [3]. It also quickly 
turned out that TZAA^, while having the same connection to P-vs.-NP as fP, has different properties 
than fP (e.g., regarding the Green relations, and actions on {0,1}'^; see [21 [3]). It is hard to know 
whether this approach will contribute to a solution of the P-vs.-NP problem, but the monoids fP and 
TZA42 3-re interesting by themselves. 

Above and in the rest of the paper we use the following notation and terminology. We have an 
alphabet A, which will be {0,1} unless the contrary is explicitly stated, and A* denotes the set of all 
strings over A, including the empty string e. For x G A*, |x| denotes the length of the string x. For 
a partial function f : A* ^ A*, the domain is Dom(/) = {x G A* : f{x) is defined}, and the image 
is lrri(/) = f{A*) = /(Dom(/)). When we say “function”, we mean partial function (except when 
we explicitly say “total function”). Similarly, for a deterministic input-output Turing machine with 
input-output alphabet A, the domain of the machine is the set of input words for which the machine 
produces an output; and the set of output words is the image of the machine. 

A function f : A* ^ A* is called polynomially balanced iff there exists polynomials p, q such that 
for all X G Dom(/): |/(x)| < p(|x|) and |x| < q{\f{x)\). The polynomial q is called an input balance 
function for /. 

As we said already, fP is the set of partial functions f : A* ^ A* that are polynomially balanced, 
and such that x G Dom(/) i— f{x) is computable by a deterministic polynomial-time Turing machine. 
Hence, Dom(/) is in P when / G fP, and it is not hard to show that lm(/) is in NP. Clearly, fP is a 
monoid under function composition. 

A function f : A* ^ A* is said to be one-way (with respect to worst-case complexity) iff / G fP, but 
there exists no deterministic polynomial-time algorithm which, on every input y G lm(/), outputs some 
X £ A* such that /(x) = y. By “one-way” we will always mean one-way with respect to worst-case 
complexity; hence, these functions are not “cryptographic one-way functions” (in the sense of, e.g.. 
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[g la no]). However, they are important for the P-vs.-NP problem because of the following folklore 
fact (see e.g., [11] p. 33): One-way functions exist iff P ^ NP. 

As is easy to prove (see the Introduction of m), / G fP is not one-way iff / is regular in fP. By 
definition, an element / in a monoid M is regular iff there exists f ^ M such that f f f = /; in this 
case, f is called an inverse of /0 A monoid M is called regular iff all the elements of M are regular. 
In summary we have: The monoid fP is regular iffP = NP. 

Let us look in more detail at the monoid TZAd^- ^ right ideal of A* is a subset R f A* such that 
RA* = R (i.e., R is closed under right-concatenation by any string). For two strings v,w G A*, we 
say that u is a prefix of w iff (fix G A*)[vx = w]. A prefix code in A* is a set P C A* such that no 
word in P is a prefix of another word in P. For any right ideal R there exists a unique prefix code Pjj 
such that R = Pj^ A*; we say that generates P as a right ideal. For details, see e.g. [gil]; a good 
reference on prefix codes, and variable-length codes in general is [T]. 

A right-ideal morphism is a partial function h : A* ^ A* such that for all x G Dom(/i) and 
all w G A*: h{xw) = h{x)w. In that case, Dom(/i) and lm(/i) are right ideals. For a right-ideal 
morphism h, let domC(/i) (called the domain code) be the prefix code that generates Dom(/i) as a 
right ideal. Similarly, let imC(/i), called the image code, be the prefix code that generates lm(/i). So a 
right-ideal morphism h is determined by h\domC(h) (the restriction of h to its domain code). In general, 
imC(/i) C /i(domC(/i)), and it can happen that imC(/i) /i(domC(/i)). We define 

TZM.^ = {/ G fP : / is a right-ideal morphism of A*}. 

By Prop. 2.6 in [2], / G TZM.^ is regular in TZAi^ iff / is regular in fP. The monoid TZM.^ is regular 
iffP = nP. 

We saw (Cor. 2.9 in [2]) that fP and TZAi^ are not isomorphic, that the group of units of TZAi^ is 
trivial (Prop. 2.12 in |2]), and that TZAi^ has only one non-0 j7"-class (Prop. 2.7 in [2]). In [3] we will 
see that TZAi^ has interesting actions on {0,1}^^, and has interesting homomorphic images (some of 
which are regular monoids, and some of which are regular iff P = NP). Overall, TZAi^ seems to have 
“more structure” than fP. 

It is proved in [2] (Section 3) that fP is isomorphic to a submonoid of TZAi^- To prove this, we use 
an encoding of the three-letter alphabet {0,1, ff} into words over the two-letter alphabet {0,1}; this 
encoding will also be used here. First, we encode the alphabet {0,1, ff} by code(O) = 00, code(l) = 01, 
code(#) = 11. A word xi .. .Xn G {0,1, #}* is encoded to code(xi) ... code(x„). For a fixed A; > 0, 
a fe-tuple of words {ui ,..., Uk-i,Uk) G {0,1}* x ... x {0,1}* is encoded to code(ui # ... Uk-i #) Uk 
= code(Mi) 11 ... code(ufc_i) 11 Uk G {0,1}*. A function / G fP is encoded to G TZAd^, defined 
by domC(/‘^) = code(Dom(/) ff), so Dom(/‘^) = code(Dom(/)) 11 {0,1}*; and 
/'^(code(a: #) u) = code(/(x) #) u, 

for all X G Dom(/) and v G {0,1}*; equivalently, f^{code{x) 11 v) = code(/(x)) 11 v. Then for every 
L C {0,1}*, code{Lff) is a prefix code, which belongs to P iff L is in P. And / G fP iff G TZAi^- 
The transformation / i-t- is a isomorphic embedding of fP into TZAi^', moreover, is regular in 
TZAi 2 iff / is regular in fP. From here on, the alphabet denoted by A will always be {0,1}. 

In [2] (Section 4) we introduced a notion of polynomial program for Turing machines with built-in 
polynomial counter (for input balance and time-complexity). These programs form a machine model 
that characterizes the functions in fP. For a polynomial program w, we let fw € fP denote the function 
computed by this program. For every polynomial q of the form q{n) = an^ -\-a (where a, k are positive 
integers), we constructed an evaluation map ev^ G fP such that for every polynomial program w with 
built-in polynomial Pu,(n) < q{n) (for all n > 0), and all x G A*, 
ev^(code(t(;) 11 x) = code{w) 11 4>w{x) 

^The terminology varies, depending on the field. In semigroup theory /' such that ff'f = / is called a semi-inverse 
or a pseudo-inverse of /, in numerical mathematics /' is called a generalized inverse, in ring theory and in category 
theory it’s called a weak inverse. In semigroup theory the term “inverse” of / is only applied to /' if f ff = f' holds 
in addition to ff'f = /. It is easy to see that if ff'f = / then f'ff' (= v) satisfies fvf = / and vfv = v. 
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if X G Dom(i;/)^); if x 0 Dom((/)i„) then ev^(code(t(;) 11 x) is undefined. We used ev^, with any 
polynomial q of degree > 2 with large enough coefficient, to prove the following: First, fP is finitely 
generated (Theorem 4.5 in [2]). Second, ev^ is complete in fP with respect to inversive polynomial 
reduction (Section 5 of [2]). Later in this paper (Def. 12.41 and following) we define completeness and 
various reductions for along the same lines as for fP. 

Note that fP and ia their entirety, do not have evaluation maps that belong to fP, respec¬ 
tively (since such maps would not have polynomially bounded complexity). That is the reason 

why we restrict ev and evR to complexity < g(.), and why we need precise machine models for fP and 
TZM.^ (as opposed to more intuitive “higher-level” models). 

In Section 2 we define a machine model that characterizes the functions in and for any large 

enough polynomial q we construct evaluation maps evR^ and evR^*^ for the functions in that 

have balance and time-complexity < q. We prove that evR^^ is complete in TZAi^ (and in fP) with 
respect to inversive Turing reduction. In Section 3 we prove that TZAi^ is not finitely generated, and in 
Section 4 we show that infinite generation has some complexity consequences, i.e., inhnite generation 
can be used for a time-complexity lower-bound argument. 

2 Machine model and evaluation maps for IZAi^ 

The evaluation map ev^ : code(rc) 11 x i —> code(rc) 11 (f)yj{x), that we constructed for fP in [2], works 
in particular when G TZAi^ (provided that has time-complexity and input-balance < q). But 
ev^ is not a right-ideal morphism and, moreover, ev^ can evaluate functions that are not in TZAi^- We 
want to construct an evaluation map that belongs to TZAi^, and that evaluates exactly the elements of 
TZAi 2 that have balance and complexity < q. In [2] we constructed a machine model for fP, namely a 
class of Turing machines with built-in polynomial counter (for controlling the time-complexity and the 
input-balance). We will rehne these Turing machines in order to obtain a machine model for accepting 
the right ideals in P, and for computing the functions in TZAi^- 

We will consider deterministic multi-tape Turing machines with input-output alphabet A, with 
a read-only input tape, and a write-only output tape. Moreover we assume that on the input tape 
and on the output tape, the head can only move to the right, or stay in place (but cannot move 
left). We assume that the input tape has a left endmarker and a right endmarker B (the blank 
symbol). At the beginning of a computation of such a machine M on input z G A*, the input tape 
has content #zB, with the input tape head on # ; initially, all other tapes are blank (i.e., they are 
filled with infinitely many copies of the letter B). The output tape does not need endmarkers (since 
it is write-only). We assume that M has a special output state gout) and that M only goes to state 
gout when the output is complete; the output state is a halting state (i.e., M has no transition from 
state gout)- An important convention for a Turing machine M with non-total input-output function 
Jm is the following: If M on input x halts in a state that is not gout? then there is no output (even if 
the output tape contains a non-blank word). So, in that case, fM{x) is undefined. The content of the 
output tape is considered unreadable, or hidden, until the output state gout is reached. 

This kind of Turing machine can compute any partial recursive function (the restrictions on the 
input and output tapes do not limit the machine, because of the work-tapes). To compute a function 
in fP, we add a built-in polynomial (used as a bound on input balance and time-complexity); see 
Section 4 in [2]. 

In order to obtain a machine model for the functions in IZAi^ the above Turing machines (with 
built-in polynomial) will be restricted so that they compute right-ideal morphisms of A*. This is done 
in two steps: First, sequential functions and sequential Turing machines are introduced. From this it 
is easy to obtain a class of Turing machines that compute right-ideal morphisms (which are a special 
kind of sequential functions). Recall that by “function” we mean partial function. By definition, a 
function / : A* —>• A* is sequential iff 
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for all xi,X 2 E Dom(/); if xi is a prefix of X 2 then f{xi) is a prefix of f{x 2 )- 
Obviously, every right-ideal morphism is a sequential function. 

A sequential Turing maehine is a deterministic multi-tape Turing machine M (with special input 
tape and special output tape and output state, according to the conventions above), with input-output 
function /m, such that the following holds. 

For every x € Dom{fM) and every word z G A* : in the computation of M on input xz, 
the input-tape head does not start reading z B until fnix) has been written on the output tape. 
To “read a letter T’ (in zB) means to make a transition whose input letter is i. So, the input tape 
has content ffxzB, with the input-tape head on the left-most letter of zB (but no transition has been 
made on that letter yet), and the output tape now has content fM{x). Of course, at this moment the 
computation of M on input xz is not necessarily finished; the state is not necessarily gout) the output 
might still grow, and gout might be reached eventually, or not; if gout is never reached, there is no final 
output. 

The sequential Turing machines form a machine model for the partial recursive sequential functions. 
If we let the machines have a built-in polynomial we obtain a machine model for the sequential functions 
in fP. 

Finally, to obtain a machine model for the functions in we take the sequential Turing 

machines with built-in polynomial, with the following additional condition. 

For every x G Dom(/M) and every word z G A* : in the computation of M on input xz, 
once fM{x) has been written on the output tape (after x was read on the input tape), 
the remaining input z is copied to the output tape; at this point the state gout is reached. 

We call such a machine an TZAi^''^^'^hine. 

The following shows how, from an fP-machine for a function /, an TZAi^-m.a.eh.uye for / can be 
constructed, provided that / G TZM.^. 

Let us first consider right ideals in P, rather than functions. For any polynomial program w for 
a Turing machine that accepts a language L G P, we construct a new polynomial program v 
describing a Turing machine M,, that behaves as follows: On any input x G {0,1}*, My successively 
examines prefixes of x until it finds a prefix, say p, that is accepted by M^; My does not read the letter 
of x that comes after p until it has decided that p ^ L. As soon as My finds a prehx p oi x such that 
p G L, My accepts the whole input x. If accepts no prefix of x, My rejects x. Thus, My accepts 
LA* (the right ideal generated by L); if L is a right ideal then LA* = L. If My, has time-complexity 
< r(.) (a polynomial) then My has time-complexity < T{.)‘^. 

Let us now consider functions in TZAi^. Given any polynomial program w for a function cfy, G fP, 
we construct a new polynomial program v such that My, on input x, successively examines all prefixes 
of x until it finds a prefix p in Dom((/)^); let 4>w{p) = V- Then, on input x, the machine My outputs y z, 
where z is such that x = pz. Note that since p is the shortest prehx of x such that p G Dom((/)i„), we 
actually have p G domC(0^) (if Dom((^^) is a right ideal). The machine My does not read the letter of x 
that comes after a prehx p until it has decided that p 0 Dom((/>i„) or p G domC((/>i„). Hence, the function 
computed by My is in TZAi^- This construction describes a transformation / G fP i—)• /pref G TZAi^, 
where /pref is dehned as follows: 

/pref(a:) = f{p) z, 

where x = pz, and p is the shortest prehx of x that belongs to Dom(/); so, p G domC(/pref). Thus for 
every / G fP we have: / G TZAt^ iff /pref = /• 

Based on T^AI^-ma'Chines we can construct evaluation maps for TZAi^- Let g be a polynomial 
where g(n) = an^ a for some integers a, fc > 1. We dehne evR^, as follows: 

evR^ (code(t(;) 11 x) = code(tc) 11 4>w{x), 

for all TZAi^-pTogiam.s w with built-in polynomial pw < q, and for all x G Dom((;/!),r))- The details of 
the construction are the same as for ev^; see Section 4 in [2]. Although evR^ belongs to TZAi^ and 
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evaluates all TZfA^-^Yogvam.s w with built-in polynomial < q, we will prove in Theorem 14.31 that the 
complexity of evR^ is higher than q. 

The following doubly coded evaluation function is usually more useful for 7 ?.A^ 2 "P'^ograms. It is 
defined by 

evR^*" (code(tt;) 11 code(u) 11 v) = code(t(;) 11 code((/>^(u)) 11 v, 
when u G domC(</>^), v G A*, and w is as before. 

To give a relation between evR^ and evR^*^ we will use the following partial recursive right-ideal 
morphism 7 , defined for very 7?.A1 2 -program w G A* and every x G Dom((/)^) by 

7 (code(r(;) 11 x) = code(t(;) 11 code(u) 11 u, 

where x = uv, and u is the shortest prefix of x such that u G Dom((/)i„); equivalently, u G domC((/)i„). 
When x 0 Dom((/)i„), 7 (code(r(;) 11 x) is undefined. Essentially, 7 finds the shortest prefix of x that 
belongs to Dom((;/!)^„) (or equivalently, to domC((?!)iu)). The function 7 can be evaluated by examining 
successively longer prefixes of x until a prefix u G Dom((;/!)^„) is fund. So 7 is computable with recursive 
domain, when w ranges over 7 ?.A^ 2 -programs. 

For any fixed 2 -program tc, let 7 ^ be 7 restricted to this tc, i.e., 7 ^ = 7|code(ij)) 11 A*- Iri other 
words, Dom( 7 ^) = code(tc) 11 Dom((;/!)^„), and 

7 i„(code(t(;) 11 uu) = code(r(;) 11 code(u) 11 u 

when u G domC(0^), v G A*. Similarly we define by Dom( 7 °) = Dom(ij!)^„) (as opposed to 
code{w) 11 Dom((/).u,)), and 

7 °(tiu) = code(r(;) 11 code(n) 11 u 

when u G domC((/>^), v G A*. So, lm( 7 ") = lm( 7 ^) = code(ui) 11 domC((?!)^) 11^*. 

Then 7 ^ and 7 ° belong to TZAi^ for every fixed w. But 7 itself is not polynomial-time computable, 
since it has to work for all possible T^Al^-programs w. 

Another restricted form of 7 that belongs to TZAi^ is obtained by choosing a fixed polynomial q, 
and dehning 7 ^ as the restriction of 7 to the set 

{code(t(;) 11 x : tc is a T^Ad^-program with built-in polynomial < q, and x G Dom((^^u)}. 
Hence, jq G TZAi^. 

We also define the functions vro, vri, po; Pi £ TZAi^ by 7ra(x) = ax, Pa{ax) = x, for all x G {0,1}* 
and a G {0,1}. For a word w = a^ ... ai with a* G {0,1} we denote 'Ka„ o ... o vr^j by tt^, and 
Pa„ o ••• opai by p^. 

Then we have: 7 " = 7 ^ o 7rcode(«;) 11, and 7 ^ = 7 ° o Pcode(u,) 11 • 

Another important function in TZAi^ is the decoding function, defined for any u,v G A* by 
decode(code(u) 11 u) = uv, 

so domC(decode) = {00, 01}* 11, and imC(decode) = {ej. We also define a second-coordinate decoding 
function, for all ui,U 2 ,v G A*, by 

decode 2 (code(ui) 11 code(u 2 ) 11 v) = code(ui) 11 U 2 v. 

So, decode 2 G TZAi^, domC(decode 2 ) = {00,01}* 11 {00,01}* 11, and imC(decode 2 ) = {00,01}* 11. 

Now we can formulate a relation between evR^ and evR^^; 

evR^ = decode 2 o evR^'" o 7 ^. 

In order to show that evR^*^ is complete with respect to inversive reduction in TZAi^: we will 
adapt the padding and unpadding functions (defined for fP in [2], Section 4) to TZAi^- Although for 
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7^A^2 keep the same names as for the corresponding (un)padding functions in fP, the functions 
are slightly different. The padding procedure begins with the function expand(.), defined by 

expand (code(t(;) 11 code(u) 11 v) 

= code{ex{w)) 11 o4|“de(«)|2+8|code(«)|+2 code(u) 11 V, 

for all u € domC((?!)^), v G A*, and TZAi^-Y>^ogxam.s w. The word kode(«)P+8|code(u )|+2 jg q£ 
form code(s) for a word s G 0*1; and l“de(«)P+8 |code(u )|+2 g^ (-ode(n) is also a code word, namely 
code(su). Since 0*1 and its subset (00)* 01 are prefix codes, code(s) = l“deHP+8|code(n )|+2 g^ 
uniquely determined as a prefix of code(stt). 

Here, ex{w) is an 7?.Ad 2 -program obtained from w so that 

(/>ex(«,)((00)^ 01 code(rt) 11 u) = (00)^ 01 code{(f)wiu)) 11 v, 

for all u G domC((?!>^), u G A*, and h > 0. Moreover, if n 1 —)• a+ a is the built-in polynomial of the 
program w then the built-in polynomial of ex{w) is 

Pe{n) = + tte, with Oe = max{ 12 , [ 0 / 2 ^] -|- 1 }. 

The detailed justification of the numbers used in the definition of expand and ex (as well as reexpand, 
recontr, and contr below) is given in [2], Section 4. 

It is important that expand uses the prefix w of x for padding (in the format code(u) 11, where 
u G domC((/>i^)). If the whole input x were used for computing the amount of padding, expand would 
not be a right-ideal morphism. This is the reason why we introduce or 7 )),, in order to isolate the 
prefix u G domC((/)tu) of x. 

We iterate expansion (padding) by applying the following function, where ex(.) is as above: 
reexpand(code(ex( 2 ;)) 11 0^ 01 code(u) 11 v) 

= code(ex( 2 )) 11 01 code(rt) 11 v, 

where k > 0, u,v ^ A*, and z is any T^Ad 2 -program; k is even in the context where reexpand will be 
used. 

Repeated contraction (unpadding) is carried out by applying the following function, for k > 0: 

recontr(code(ex(z)) 11 (00)*^ 01 code(y) 11 u) 

= code(ex( 2 )) 11 ( 00 )”'"^^^ L^/fc/2J-l} g^ code( 2 /) 11 v; 

note that max{l, [■\/k/2\ — 1 } > 1 . 

The unpadding procedure ends with the application of the function 

contr(code(ex( 2 ;)) 11 (00)^ 01 code(y) 11 v) = code( 2 ;) 11 code{y) 11 v, 

if 2 < |(00)^| = 2k < 4 |code(r/)p-b 8 |code(y)|-|-2. 

The functions expand(.), ex(.), reexpand(.), recontr(.), and contr(.), are undefined in the cases where 
no output has been specified above. 

Lemma 2.1 Let q 2 be the polynomial defined by q 2 {n) = 12-|- 12. For any fiw G T^Ad^, where w 
is a TZM .2 -program with built-in polynomial q (of the form q{n) = a -\- a for positive integers a, k), 
we have for all u G domC((/>^), v £ A* : 

(★) (pwiuv) 

= decode o Pcode(w) 11 o contr o recontr^'" o evR^*" o reexpand™ o expand o j°(uv) 

= Pcode(w) 11 ° decode 2 o contr o recontr^*" o evR^*^ o reexpand"* o expand o 7" (nu), 
where m = |'log 2 (a -|- /c)]. 
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Proof. This is similar to the proof of Prop. 4.5 in [2], with a few modifications. For u G domC((/>^) 
V G A*, 


uv 1 —^ code(ui) 11 code(ri) 11 v 

code(ex(u;)) 11 l“deH|2+8 |codeH |+2 q;l code(ri) 11 v 


reex^d code(ex(ti;)) 11 0^"‘+i 01 code(M) 11 v, 


where Ni = 4|code(M)p + 8|code(tt)| + 2, so 01| = (2 (|code(ri)| + 1))^; by induction, Ni = 
^^i-i + 8 A^j-i + 2 for 1 < z < 2m + 1, and 01| = (2 + 1))^. The above string, which will 

now be the argument of has length > N^+i + 2 + |code(tt)|, which is much larger than the time 

it takes to simulate the machine with program w on input u. So can now be applied correctly. 

Continuing the calculation. 



code(ex(t(;)) 11 01 code{cj)w{u)) 11 v 


recontr^*" 


code(zc) 11 00 01 code((^^(u)) 11 v. 


We use 2m in recontr^™ because (j)w{u) could be much shorter than u] but because of polynomial input 
balance, 1^1 < Piu(|<?!)ij)(n)|). Note that doing more input padding than necessary does not do any harm; 
and recontracting (unpadding) more than needed has no effect (by the definition of recontr). Hence 
contr can now be applied correctly. We complete the calculation: 


code(t(;) 11 code(0i„(u)) 11 v 


decode2 


code(z(;) 11 4>w{u) v 


Pcode{w) 11 




V. 


□ 


Lemma 2.2. TZAi^ following infinite generating set.' 

{decode, po, pi, ttq, tti, contr, recontr, evR^*", reexpand, expand} 

U { 7 i„ : w is any TZM.^-program]. 

Here, decode can he replaced by decode 2 . Yet another infinite generating set ofTZM.^ is 

{Pq, Pi,ttq,tti} U (evR^ : q is any polynomial of the form q{n) = a+ a with a, k . 

Proof. The first infinite generating set follows from Lemma [2Tl Recall that Jw = Jw ° 7rcode(io) ii- 
The second generating set follows in a straightforward way from the proof of Prop. 4.5 in [2]. □ 


Proposition 2.3 TZAi^ is generated by a set o/regular elements ofTZAi^- 


Proof. The generators po, pi,ttq,tti are easily seen to be regular. Thus, using the second infinite 
generating set in Lemma 12.21 it is enough to factor evR^ into regular elements. We have: 

evR^ = p2,g o Eg, 

where Eg and p 2 ,g are defined as follows: For every T^.Wf^’Pi'ogram w with built-in polynomial < q, 
and every u G domC(0i„) and v G A*, 

Eg(^code{w) 11 code(u) 11 v) = code{w) 11 code(u) 11 code(0ui('w)) H v, 
and for all z,y,x,v G A* such that |x| < q{\y\), 

P 2 ,q{code{z) 11 code(x) 11 code{y) 11 v) = code(z) 11 code(y) 11 v. 

The functions are undefined otherwise. It is easy to see that Eg and p 2 ^q have polynomial-time inversion 
algorithms (i.e., they are regular), and belong to TZAi^- n 

We will show now that evR^^ is complete in TZAi^ and in fP, with respect to a certain “inversive 
reduction”. We need to recall some definitions from [2] concerning reductions between functions in fP 
or TZAi 2 , and in particular, reductions that “preserve one-wayness” (inversive reductions). 


7 


Definition 2.4 Let /i ,/2 : A* ^ A* be two polynomially balanced right-ideal morphisms. 

(1) We say that f 2 simulates /i (denoted by fi = 4 / 2 ) iff there exist a, /3 E TZM.^ such that fi = fdof 2 oa. 

(2) We have a polynomial-time Turing simulation of fi by f 2 (denoted by fi f 2 ) iff fi can be 
computed by an oracle TZAi^-machine that can make oracle calls to / 2 ; such oracle calls can, in 
particular, be calls on the membership problem o/Dom(/ 2 ). 

In the above definition, /i ,/2 need not be polynomial-time computable. 

Since TIM .2 is J"°-simple (Prop. 2.7 in [2]), every /i E TZAi^ is simulated by every /2 E TZA4.2—{ff\ 
(for each of the above simulations). 

Definition 2.5 (Inversive reduction). If =4x is a simulation between right-ideal morphisms (e.g., 
as in the previous definition) then the corresponding inversive reduction is defined as follows. We say 
that fi inversively X-reduces to f 2 (denoted by ff ^inv,x f 2 ) iff 

(1) A f 2 , and 

(2) for every inverse A of A there exists an inverse f[ of A such that f[ fff here, A and f[ 
range over all polynomially balanced right-ideal morphisms A* ^ A*. 

Note that 77°-simplicity (Prop. 2.7 in [2]) does not apply for inversive reduction since A; A range 

over just TZAA^- One easily proves the following about polynomially balanced right-ideal morphisms 
A) A (see [2] , Section 5): 

ff A ^inv,T A and A £ then A E IZAi^; if, in addition, A is regular, then A is regular 

(equivalently, if, in addition, A is one-way, then A is one-way). 

Definition 2.6 A polynomially balanced right-ideal morphism A is complete in a set S (of right-ideal 
morphisms) with respect to an (inversive) reduction ^inv,x iff fo G S, and for all fi G S: <f> ^inv.x A- 

See Section 5 of [2] for more details and properties of these simulations and reductions; in [2] the 
focus was on fP, whereas here we concentrate on IZAA^. The simulations in Def. 12.41 are similar to the 
standard notions of reductions between decision problems. The concept of inversive reduction was first 
introduced in [2]; it is the appropriate notion of reduction between functions when one-wayness is to 
be preserved under upward reduction (and regularity is to be preserved under downward reduction). 

In the above definitions we only refer to polynomially balanced inverses; this is justified by the 
following Proposition, according to which “balanced functions have balanced inverses”. 

Proposition 2.7 Suppose f is a right-ideal morphism with balance < q[.) (where q{.) is a polyno¬ 
mial), and f has an inverse f[ with time-complexity < T(.). Then f has an inverse f with balance 
< q and time-complexity < T(.) -|-€(?(.) (for some constant c> 1). The inverse f can be chosen as 
a restriction of f[. 

Proof. Let f be the restriction of f[ to the set 

{yEDom(A): \y\ < q{\f{iy)\) and \f{{y)\ < q{\y\)}. 

Then f obviously has balance < q. Note that since f[ is an inverse of / we have lm(/) C Dom(A)- 
To show that f is an inverse of / it is sufficient to check that the domain of f contains lm(/). Let 
y = f{x) E lm(/) for some x E Dom(/). Then f{f{{y)) = y, since /( is an inverse. 

Checking \y\ < q{\f{{y)\). \y\ = |/(A(y))l — Q{\f{{y)\'i inequality holds since g is a balance for 
/ on input f{{y). 

Checking \f{{y)\ < q{\y\): \f[{y)\ < q{\f{f[{y))\) since g is a balance for / on input f[{y); and 

qi\fif[iy))\) = qi\y\) since fiffiy)) = y. 

To find a time-complexity bound for /', we first compute f{{y) in time < T(|y|); thereby we also 
verify that y E Dom(A)- To check whether y is in the domain of f we first compare |y| and \ fi{y)\ in 
time < |y| -|- 1. 


Checking \y\ < q{\f[{y)\): If |y| < \f[{y)\ then we automatically have |y| < q{\f[{y)\). If |y| > 
\f{{y)\ we compute q{\f[{y)\) in time 0{q{\f[{y)\)) (< 0 (g(|?/|))), by writing the number \f[{y)\ in 
binary, and then evaluating q (see Section 4 of [2] for a similar computation). Then we check \y\ < 
q{\f{{y)\) in time < \y\ + 1. Checking \f{{y)\ < q{\y\) is done in a similar way, in time < 0{q{\y\)) + 

li/l + l- □ 

Theorem 2.8 The map evR^^ is complete for TZAi^ with respect to inversive Turing reduction. 

Proof. Lemma l2.II provides the following simulation of by evR^^: 

4>w = decode o /?code(«)') ii o contr o recontr^™ o evR^^ o reexpand™ o expand o 7 ^, . 

To obtain an inversive Turing simulation, let e' be any inverse of evR^^. Slightly modifying the proof 
of Prop. 5.6 in [2], we apply e' to any string of the form 

code(ex(r(;)) 11 0 ^"*+! 11 code(p) 11 z, 

where p E (/)i„(domC(0t„)), and z E A*; then for any p E imC(0ui) (^^(domC((^^))), and z ^ A*: 
e'(code(ex(t(;)) 11 0 ^”*+^ 11 code(p) 11 z) 

= code{ex{w)) 11 0 ^”*+^ 11 code(t) 11 z, 

for some t E (p~^{p) C Dom((/)i„). Based on e' we now construct an inverse of (pw such that cp'^ e'; 
for any y E lm((/)i„) we define 

(pwiu) = decode o Pcode{w') 11 ° contr o recontr^™ o e' o reexpand'" o expand o 6°(y). 

Here, (5° (y) is defined by 

^w(y) = code(r(;) 11 code(p) 11 z, 

when y = pz with p E imC((/)i„), z E A*. So, is similar to except that uses imC((/>i„), 

whereas 7 )(,(.) uses domC((/)i^). We saw that E but unless P = NP, <5° will not be in TZAi^ 

in general. 

The value can be computed by an T^A^^-machine M that makes oracle calls to Dom(e') 

and to e' as follows. On input y, M considers all prefixes of y of increasing lengths, pi,...,pk, 
until pj E lm((/)i„) is found. Since pj is the first prefix in lm((/>i„), we have pj E imC((/>i^) and 
(5°(y) = code(r(;) 11 code(pj) 11 z. To test for each pi whether pi E lm((^^), M pads pi to produce 
code(yi); if pi E lm((/)^) then e'(code(ex(r(;)) 11 •) is defined on input 0^""+' 11 code(pi). 
Thus, if code(ex(r(;)) 11 0^”*+' 11 code(pj)) ^ Dom(e'), then pi ^ lm((/i^). On the other hand, if 
code(ex(u;)) 11 0^”*+' 11 code(pj)) E Dom(e'), then let ti E 4>w^{Pi) be such that 
e'(code(ex(r(;)) 110'^"'+' 11 code(pj)) = code(ex(t(;)) 11 0'^"'+' 11 code(L). 

One oracle call to &' yields this, and hence L. Then we can use 4>w to check whether ti E Dom((/)i„); 
and this holds iff pi E lm(0i„). This way, M can check whether pi E lm((/)^). Thus, if y E lm((/)^), M 
will find pj E lm((/ii„). When y ^ lm(0^), M produces no output; this doesn’t matter since we do not 
care how 0 '^ is defined outside of lm((/)^). 

Once df^{y) is known, the remaining simulation 

decode o Pcode(io') 11 ° contr o recontr^™ o e' o reexpand™ o expand 
of e', applied to (I(),(y) = code(u;) llcode(p) 11 2 , yields (pwiv)- 

The function (p'.^ is an inverse of <pw. Indeed, for x E Dom((/).u,), we have (pw{x) = pz for some 
p E imC(0t„), 2 : E A*. Then 

reexpand™ o expand o 5)),(pz) = code(ex(tc)) 11 0'^"*+' 11 code(p) 11 z; 
and applying e' then yields 

code(ex(Tc)) 11 0'^"'+' 11 code(t) 11 z. 
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for some t E cj)~^{p). Applying 

decode o Pcode{w') ii ° contr o recontr^™ 

now yields tz. Finally, = pz, since t E cj)~^{p). So, = (j)w(l)'^{pz) = = pz = 

4 >w{x). □ 

We show next that evR^*^ is not only complete for TZAi^^ but for all of fP. 

Proposition 2.9 The map evR^^ f'E TZAi^ J %s coTTiplete foT fP with vespect to ^inv,T * 

Proof. By Prop. 5.6 in [2], ev^ is complete in fP for inversive simulation. By Prop. 5.17 in [2], 
ev^ ^inv (ev^)*^. Moreover, (ev^)*^ ^inv,T evR^*^; indeed, (ev^)*^ E TZAi^ (since f maps into 

TZAi^), and we just saw that evR^^ is complete in TZAi^- Hence ev^ ^inv ^inv,T SvR^^*^. □ 

3 Non-finite generation 

In [2] we proved that fP is finitely generated, and we left open the question whether TZAi^ is also 
finitely generated. We will now answer this question negatively. We will use the following general 
compactness property: If a semigroup S is hnitely generated, and if P is any infinite generating set of 
S, then S is generated by some hnite subset of this set P. 

Theorem 3.1. TZAi^ is not finitely generated. 

Proof. We saw that TZAi^ is generated by the infinite set 

{poj Pi, ^ 0 , tti, decode 2 , contr, recontr, evR^'", reexpand, expand} 

U { 7 ^ : m is an TT-Al^-programj. 

Let us assume, by contradiction, that TZAi^ is hnitely generated. Then a hnite generating set can be 
extracted from this inhnite generating set, so TZAi^ is generated by 

Lfin = {po, Pi, ttq, vTi, decode 2 , contr, recontr, evR^^, reexpand, expand} U { 7 * : f E F}, 

where F is some hnite set of TT-AI^-programs. So for every 7 ^ there is a word in Pfi^ that expresses 
-jw as a hnite sequence of generators. Recall that Dom( 7 ^) = code(ri;) 11 Dom((/i^„), and for any 
X E Dom((/)^), 

7 .u,(code(r(;) 11 x) = code(r(;) 11 code(u) 11 u, 
where x = uv and u E domC((^^). 

The proof strategy will consist in showing that there are inhnitely many functions 7 ^ that do not 
have a correct representation over Pfjn. More precisely, for all TT.Al^-pi’ograms w and all u E domC((/it„), 
we have 7 i„(code(t(;) 11 u) = code(t(;) 11 code(u) 11; so 7 ^(code(tc) 11 u) E {00,01}* 11 {00,01}* 11. On 
the other hand, we will show that there exist (inhnitely many) 77 Al 2 "P^^ogi'fi™s w such that for every 
X E (Pfin)* that represents 7 ^, there exist (inhnitely many) u E domC((^^) such that: A(code(tc) 11 u) 
= code(t(;) 11 code(ui) 11 U 2 , where U 2 is non-empty; so, A(code(tc) 11 u) 0 { 00 , 01 }* 11 { 00 , 01 }* 11 . 
Thus we obtain a contradiction. 

We consider the 77 A 42 "P^og^^™s w such that domC((/)«,) satishes: 

( 1 ) no word in domC((/>i„) contains 11 as a subsegment; 

(2) for all i E F, domC((/>j) domC((?!)^); 

(3) for any u E domC((?!)^) and any integer n > 0, there exists v E domC(^^) of length |u| > n such 
that u = uqc, uq is a prehx of v, and |c| <4. Equivalently: 

(Vn E domC((^^)) (Vn > 0 ) E domC(0^), |u| > n) (3uo, c, z E A*) [v = u^z, u = uqc, |c| < 4]. 
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We can picture this as a path in the tree of A*, labeled by u and ending at vertex n; at vertex uq 
along this path, at distance < 4 from vertex u, a second path branches off and ends at vertex v (of 
length |t;| > n). 

The following family of examples shows that there exist infinitely many 7?.A4 2 -programs w that 
satisfy properties (l)-(3). In each of these examples (parameterized by a E {0,1}*) we have 

domC((?iu,) = {code(a”) 0010 : n > 0}, 

where a E {0,1}* is any fixed non-empty word (depending on w), chosen so that domC(i?iu,) / domC((/>j) 
for all i E -F (thus property (2) holds). Any word a that is long enough will work; indeed, for 
different words a the above prefix codes are different, whereas F is finite. Property (1) follows from 
the definition of code (namely, code(O) = 00, code(l) = 01). Property (3) holds because for every 
u = code(a™') 0010 and every n > 0, we can take uq = code(a™') and v = code(a"'+”*) 0010. The set 
{code(a”') 0010 : n > 0 } is a regular language, with regular expression (code(a))''' 0010 . 

Let X E Pfi,^ be a representation of 7^., where w is any FAd^-program from the family of examples 
above with properties (l)-(3). We will consider certain suffixes Si of X, over Pfjn. 

Let So be the shortest suffix of X such that for all u E domC((/)^), S'o(code(t(;) 11 tt) is of the form 
code(x) llcode(y) 11 z E {00,01}* 11(00,01}* 11(0,1}*. Then Sq exists since X itself (representing 
7^) maps code(t(;)lltt to code(t(;) 11 code(n) 11 E (00,01}* 11 (00,01}* 11 (0,1}*. 

Inductively we define Si, S 2 , ■ ■ ■, Si,..., where Si is the shortest suffix of X that has Si-i as a 
strict suffix, and such that for all u E domC((^^) we have: 

S'i(code(t(;) llu) E (00,01}* 11 (00,01}* 11 (0,1}*. 

So, S'i(code(t(;) lln) is of the form code(r(;i) 11 code(ui) 11 n 2 with wi,ui,U 2 E A*. Then X = Sn 
for some A" > 0 (and |A| > N). 

Theorem [Q now follows from the next Lemma, according to which there are (infinitely many) 
u E domC(())tu) such that SNicode{'w) lln) = code(n;) 11 code(ni) 11 n 2 , with U 2 non-empty. On 
the other hand, X = Sjy, and X represents jw, hence by the definition of 7 ^ we have for every 
u E domC( 0 i„): S'Ar(code(n;) lln) = code(n;) 11 code(n) 11; so U2 is empty. Thus, the assumption 
that X (over the finite generating set Tfjn) represents 7 ^, leads to a contradiction. 

Lemma 3.2 Let 7 ^ be such that domC((/)t„) = {code(a"’) 0010 : n > 0} for some word a E (0,1}*, 
chosen so that the program w satisfies properties (l)-(3). Let X he a word over Tfjn that represents 
7^, and let \X\ be the length of X overVim. Let So,...,Sm be the suffixes of X defined above, with 
Sn = X. Then there exist £ and n with £ > n > 0 such that for all i = 0,..., N and all u E domC(0i„) 
with |n| > £: 

Si{co6e{w) lln) = code(n;i) 11 code(ni) 11 n 2 , 

for some wi,ui,U 2 E A*. Moreover, U 2 has a non-empty common suffix with u, and this common 
suffix has length at least n. 

Proof. We have for all n E domC(())i„): S'i(code(n;) 11 n) = code(n;i) 11 code(ni) 11 n 2 , for some 
wi,ui,U2 E A*. We want to show that there is £ such that for all n E domC((/)^) with |n| > £: U2 has 
a non-empty (sufficiently long) suffix in common with n; the number n is an auxiliary parameter. We 
take n of the form n = code(a™') 0010 and use induction on i = 0,..., A. 

Proof for Sq: The only generators from Tfjn that can occur in Sq are tto, tti, po; Pi and jj (for j G F). 
Indeed, the other generators in Tfjn (namely decode 2 , contr, recontr, evR^^, reexpand, expand) are only 
applicable to inputs of the form code(x) 11 code(y) 11 z‘, so, Sq would end before a generator in {decode 2 , 
contr, recontr, evR^*^, reexpand, expand} can be applied. Moreover, Sq cannot start with a generator 
in {decode 2 , contr, recontr, evR^*", reexpand, expand}; indeed, for all inputs code{w) 11 u E domC(A), 
u = code(a™') 0010 contains no 11, so these generators are not defined on any element of domC(A). 
So, So is over {-ko,tii, Po, Pi] U (7^ : j E F}. 
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The actions of tto, vri,/O q,/J i can change an input in at most |S'o| positions at the left end of the 
input, so these actions preserve a common suffix U 2 and u of length > |u| — |S'o|. Thus, if Sq consists 
only of instances of ttq , tti , po > Pi ) the Lemma holds for So \u\ > i > n + |S'o| and n > 0 . 

Suppose now that Sq contains jj for some j G F. Then (if m > |5o|/|a|), instances of tto, tti, po) Pi 
will transform the input u = code(u;) 11 code(a™) 0010 into a word code(x)lls code(a^) 0010 (for 
some x,s G A*, k > 0), such that jj can be applied. This action changes an input in < |S'o| positions 
at the left end of the input. Since jj is assumed to be applicable now, we must also have x = j and 
s = uoz for some po £ domC((/>j), z G A*. Then the output of is 7 j(code(j) 11s code(a^) OOlO) 
= code(j) 11 code(po) H code(a^) 0010 , thus the common suffix of U 2 and u could decrease by length 
< IpoI under the action of 7 j. So we let £ > n + |5o| + |yo| and n > 0 . Also, at most one 7 ^ (with 
j G F) occurs in Sq, since after the output is of the form code(t(;i) 11 code(ui) 11 U 2 , which marks 
the end of the action of Sq. This proves the Lemma for Sq. 

Inductive step “5* —)■ S'i+i”, for 0 < i < A^: By induction we assume that for all u G domC((/)^^,) 
with IttI > £, we have 5i(code(tc) 11 tt) = code(rt;i) 11 code(ui) 11 n 2 for some rci, ui, U 2 G A*, where n 2 
and u have a common suffix of length > n (> 0). Let us write Si+i = Tj+iS’p then T^+i is non-empty 
(by the definition of Si+i). We also let Tq = Sq. 

Claim 1: If contains a generator g G {contr, recontr, evR^^, reexpand, expand, decode 2 }, then g 
is the first (i.e., rightmost) letter of Tj+i, and g occurs only once. 

Indeed, if g were applicable later in Tj+i, the output of the generator preceding g would be of the 
form code(t(;i) llcode(tti) 11 n 2 , so S'i+i would have ended before g was applied. 

Claim 2: If Tj-i-i contains a generator g G {contr, recontr, evR^^, reexpand, expand} U { 7 ^ • j G T}, 
then g is the last (i.e., leftmost) letter of Tj+i, and g occurs only once. 

Indeed, such a generator outputs a word of the form code(t(;i) 11 code(ui) 11 U 2 . So, S'i+i ends after 
such a generator. 

As a consequence of Claims 1 and 2, if Tj+i contains a generator g G {contr, recontr, evR^^ , 
reexpand, expand}, then Tj+i consists of just g. A generator of this form does not change U 2 . 

So we can assume for the remaining cases that Tj+i is of the form L+i, or tj+i •decode 2 , or 7 ^ -U+i, 
or 7 j • L+i • decode 2 , where j G F and L+i is over the generators tto, tti, po) Pi- 

Let code(t(;i) llcode(ui) 11 U 2 be the input of Tj+i (and this is also the output of Si), where U 2 
and u have a common suffix of length > n. 

• Case where Tj+i is over the generators 7Tq,7Ti, pQ, pi: Then Tj+i changes the input in at most |Tj+i| 
positions at the left end of the input, so U 2 will not be affected if i — n> |Tj+i| (and n > 0 ). 

• Case where Tj+i = L+i • decode 2 , with over ttqjTTi, pQ, pi: The output of decode 2 is of the form 
code(u;i) 11 ui U 2 , so the common suffix of U 2 and u is preserved by decode 2 . The action of L+i, 
containing only generators from {'Kq,tti, pQ, pi}, affects at most |L+i| positions near the left side of the 
input, so U 2 is not changed if £ — n> |Tj+i| (and n > 0 ). 

• Case where Tj+i = 7 ^- • L+i, with tj+i over ttqjTTi, pQ, pp. Applications of tto, tti, po> Pi change fewer 
than |L+i| letters of the input near the left end, so the common suffix is not affected if i — n> |Ti_|_i|. 
When 7 j is applied, the output produced will be of the form code(}) 11 code(pj+i) 11 z code(a"') 0010, 
where pi+i G domC(())j). Then U 2 will not be affected if we pick £ > n + \Ti+i\ + \yi+i\ and n > 0. 

• Case where Tj+i = 7 ^ • L+i • decode 2 , with L+i over tto,tti, po,pi: This case can be handled as a 
combination of the previous two cases. 

In all the above cases the constraints are fulfilled for alH = 0,... , A^, and for all u = code(a™) 0010, if 
m > Ai + |A| +J2i^o ly*l (using the fact that 1^*1 = 1^1 )• Note that the words yi do not depend 

on the choice of the input u = code(a”^) 0010, whenever m is long enough; indeed, to determine all yi 
we can apply each Si to the infinite word code(a)‘^ G {0,1}“^. □ 
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Notation. For a given polynomial q (of the form q{n) = an^ + a with integers a,k > 1), let 

•^ 2 ^^ — if ^ 7 ?.A ^2 • / is computed by an 7 ^A^ 2 "Program with built-in polynomial < q}. 

We call w an -program iff w is an TZM^-piogisim with built-in balance and time-complexity 
polynomial < q. 

Let i.e., the submonoid of TZM^ generated by the set Obviously, we have: 

Proposition 3.3 For any set of polynomials {qi : i £ N} of the form qi{n) = -|- at, such that 

sup{aj : f G N} = -|-oo = supjfei : i G N}, we have: UieN 

The non-finite generation result for TZAi^ also holds for TZAi^^^ proof is similar. We need a 

few preliminary facts. 

Lemma 3.4 For every polynomial q of the form q{n) = an^+a with a,k >2, and every -program 
w we have: 'jw G 5^'^^. 

Proof. Recall that 7 ^(code(tc) 11 uu) = code(t(;) 11 code(tt) 11 u, where u G domC(i?!)^„). The input 
balance of 7 ^ is < q. Indeed, the input is shorter than the output; and the output length is 2|tc|-|-2-|- 
2 |m| -|- 2 -|- |u|, which is less than g(|code(r(;) 11 uu|) = q{2 |u;| -|- 2 -|-|u|-|- 2 -|-|u|) when q{n) >2n‘^ + 2. 

To compute code(r(;) llcode(u) 11 u from input code(rc) Iluv, an 7^7V4 2 -machine can proceed as 
follows: First, the machine reads and outputs code(t(;) 11. Then it runs the program w on input 
uv, i.e., it simulates the corresponding TZAi^-machiiie (which has built-in polynomial q), with 
an extra tape and a few modifications. While searching for a prefix of uv in domC((/>i„), the longest 
prefix examined so far is kept on the extra tape; the output 4>w{u) of will not be written on the 
output tape. Once u (the prefix of uv in domC((/)^)) has been found (and written on the extra tape), 
code(u) 11 u is appended on the output tape. 

All this takes time < |code(rc) 11| -|- q{\u\) -|- |code(n) 11 u| = 2\w\+2-\- ( 7 (|n|) -|- 2 |n| -|- 2 -|- |u|; this 
is < (?(2 |t(;|-b 2-|-|u|-b 2-|-|u|) = ( 7 (|code(rc) 11 uu|) when q{n)>2n^ + 2. □ 

Lemma 3.5 Let q be a polynomial that is larger than a certain polynomial of degree 5. Then TZAi^^ 
is generated by 

{po, pi, TTo, vTi, decode 2 , contr, recontr, evR^*", reexpand, expand} U { 7 ^ : 2 : is an ^ 2 '^^-program}. 

Proof. When w is an iS^'^^-program then as a consequence of Lemma l2.ll 

4>w = /Ocode(io') 11 ° decode 2 o contr o recontr^™ o o reexpand™ o expand o 7 ^, 

= Pcode{w') 11 ° decode 2 o contr o recontr^"* o evR^^ o reexpand"* o expand o TT^ode{w) 11 ) 

where q 2 is a certain polynomial of degree 2 . So the above generating set does indeed generate TZAi^^'^. 
We still need to show that these generators belong to TZAi^^'^. 

The functions pQ, pi, decode 2 , contr, recontr, reexpand, expand, ttq, tti have balance and complexity 
< 4 (n -b 1)^. And 7 ^ G TZAi^^ if w is an iS^'^^-program (by Lemma [3.411 . Let us verify that evR^^ has 
balance < q 2 and complexity O(n^). By definition, 

evR^^(code(t(;) 11 code(u) 11 v) = code{w) 11 code{(f>w{u)) 11 v. 

Then evR^^ has balance < q 2 , since on an output of length n = 2\w\ + 2 + 2\4>^{u)\ -b 2, the input 
length is < 2 |u;| -b 2 -b 2 q 2 {\ 4 'w{u)\) + 2 < ^' 2(2 |tc| -b 2 -b 2 |(/>«;(u)| -b 2) = q 2 in). 

When (j)^ can be computed by an -machine with built-in polynomial pu, (< Q' 2 ), then 

evR^*^(code(i(;) 11 code(tt) 11 ) can be computed in time < c |tc| Pwi\u\)'^ < c |t(;| q 2 {\u\)‘^, for some 
constant c > 0 (see the proof of Prop. 4.4 in [ 2 ]). Since q 2 has degree 2 , evR^^ has complexity Ofnf’). 
Thus, there exists q of degree 5 such that the above generators belong to TZAi^^'^. □ 
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Theorem 3.6 For any polynomial q such that q{n) = an^ + a, with k > 5 and a > oq (for some 
constant oq > 1), we have: is not finitely generated. 

Proof. The proof is very similar to the proof of Theorem 13.II We saw in Lemma 13.51 that is 

generated by the infinite set 

{POj Pi, ttO) decode 2 , contr, recontr, evR^*^, reexpand, expand} U {72 : z is an -program}. 

Let us assume, by contradiction, that is finitely generated. Then a finite generating set can 

be extracted from this infinite generating set; so TZ is generated by 

bfin = {po, Pi, TTo, tti, decode 2 , contr, recontr, evR^*", reexpand, expand} U { 7 * : i G F}, 

where F is some finite set of iS^'^^-programs. For every ^^'^^-program w let X be a word in Tfi^^ that 
expresses 7 ^ as a finite sequence of generators. 

From here on, the proof is identical to the proof of Theorem 13.II We use the fact that domC((/>^) = 
{code(a’^) 0010 : n > 0 } is a finite-state language, so for such a program w, has linear complexity 
(being computable by a Mealy machine) and belongs to S^\ □ 

4 Some complexity consequences of non-finite generation 

4.1 Hierarchy and separation 

Proposition 4.1 Let q be a polynomial of the form q{n) = a + a such that a,k > 1. The set 
and hence the monoid TZA4^\ are contained in a finitely generated submonoid ofTZAi^. 

Proof. Let rc be a T^Al^’Program such that has I/O-balance and time-complexity < q. Then 
evR^ can simulate (fw directly, without any need of padding and unpadding. So we have for all 
u G domC((?iu,), v G A*-. 

° evR^o 

So 52 '^^ is contained in the submonoid generated by {tto, vri,poiPi,evR^}. (Compare with Lemma 
and the proof of Prop. 4.5 in [2].) □ 

The proof of Prop. 14.11 yields the following chain of sub monoids in which non-finitely generated and 
finitely generated submonoids alternate. 

Corollary 4.2 Let ... < qi < qi+i < ... be any sequence of polynomials such that for all i > 0, 
qi^i is is large enough so that evR^ has an TZAi^-program with built-in polynomial Qi+i. Then TZAi^ 
contains a strict inclusion chain, which is infinite in the upward direction, 

... C C (7ro,7ri,po,Pi, evR^)^^P g g . 

Proof. The strictness of the inclusions in the chain follows from the fact that non-finite generation 
and finite generation alternate. □ 

Theorem 4.3 Let q he a polynomial of the form q{n) = an^ + a such that a > l,k > 1. The 
submonoid LZAi^^'^ C LZAi^ has the following properties: 

(1) TZAi^^^ fiTZAil- 

(2) If q{n) >2{n + 1)^ (for all n G Nj, then IZAi^^'^ contains elements of arbitrarily high polynomial 
balance and time-complexity. 

(3) 5^''^ IZA\^^^, ifq{n) >2{n + lf. 
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(4-) evR^ 0 TZM .^2 \ if k > 5 and a > ao (where oq is as in Theorem \3. 6\) . 

Moreover, evR^ has balance < q, but its time-complexity is not < q. 

(5) Let 91,^2 be polynomials of the above form, such that qi{n) < q 2 {n) for all n € N. Suppose 
also that qi{n) = on^ + a with k > 5 and a > oq (as in (4)), o,nd that q 2 is large enough so that 
evR^^ E Then g 

Proof. (1) Since is contained in a finitely generated submonoid of TZM^ (Prop. 14.ip . and 

TZM.^ is not contained (by C) in a finitely generated submonoid of TZM.^ (itself), inequality follows. 

(2) Consider the function s : 0"'lx e->■ for all n > 0, x E {0,1}*. Then s has time-complexity 

< 2 (n + 1)^. Indeed, a Turing machine on input O”! can read this word n times, each time turning 

2 

an input 0 into some new letter a, and each time writing O"" on the output tape; this produces O"" in 
the output; then one more copy of 0"’ is made, followed by 1. This takes time < 2 (n -|- 1)^. 

Then, s™ (i.e., the composition of m instances of s) has complexity > 2”* (since the output 
length is that high, the time must be at least that much too). Thus the functions s"* E (as m 

grows) have unbounded complexity, both in their degree and in their coefficient. 

(3) By (2), contains functions with arbitrarily high polynomial balance and time-complexity, 

whereas only contains functions with balance and complexity < q. 

(4) By Prop. 14. 1( is contained in the submonoid generated by {tto, tti, poj Pi, evR^}; and we 

easily see that tto, tti, po, Pi £ ■ Hence, if evR^ belonged to TZAi ^2 \ the monoid TZAi^^'^ would 

be finitely generated, contradicting Theorem 13.61 

The input balance of evR^ is < g (see Lemma [T5l where this is proved for 52 )- It follows that the 

time-complexity of evR^ is not < q, otherwise we would have evR^ E TZAi^'^ . 

(5) This follows from (4) since evR^ 0 TZAi^ 2 ^\ but evR^ E TZAi^ 2 ^\ □ 

Corollary 4.4 (Strict complexity hierarchy of submonoids in TZAi^). There exists an infinite 
sequence of polynomials {qi : i ^ N), each of the form qi{n) = Oin^* -|- ai with ki,ai > 1, and with 
qi{n) < gj+i(n) for all f,n E N, such that the following holds: 

TZAi^^^^ C for all i, and = ^-^2 • 

Moreover, TZAi^ (which is not finitely generated) is the union of a d-chain of 4-generated submonoids. 

Proof. The first statements follow from Theorem 14.31 fl) and Prop. [331 The last statement follows 
from Cor. 14.21 □ 

Since each TZAi^ 2 '^^ contains functions of arbitrarily high polynomial complexity (by Theorem 14.31 
( 2 )), the monoids TZAi^^''^ form a strict complexity hierarchy of a new sort, different from the usual 
complexity hierarchies. The fact that 7 ^ TZAi^ could have been shown by a diagonal argument. 
It is not clear whether classical separation techniques from complexity theory would show the results 
(1), (3), (4), (5) of Theorem 14.31 

Remark: The monoid fP, being finitely generated, does not contain an infinite strict complexity 
hierarchy of monoids (but it can contain hierarchies of sets). Indeed, we have in general: 

Fact. A finitely generated monoid M does not contain any infinite strict u-chain of submonoids whose 
union is M. 

Indeed, if we had a chain (M* : i E w) with Mq ^ .. . ^ Mi ^ Mj+i ^ . ^ Mi = M, then 

there would exist j such that Mj contains a finite set of generators of M (since Mi = M). Then 
Mj = AI, contradicting the strict hierarchy. 
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This Fact does not hold for chains over arbitrary order types; it holds for limit ordinals. The 
non-finitely generated monoid TZAi^ contains the encoding fP^ as a submonoid (see Section 3 in [2]). 
And fP^ is finitely generated (being an isomorphic copy of fP), and fP*^ contains an isomorphic copy 
of TZAi^. This leads to non-o; strict chains of submonoids of fP and of TZAi^- 

4.2 Irreducible functions 

Another consequence of non-finite generation is that TZAi^ and TZAi^'^ have “irreducible” elements, 
i.e., elements that cannot be expressed by composition of lower-complexity elements. We make this 
precise in the next definitions. 

In this subsection we do not use evaluation maps, so we can use “polynomials” q{n) = an^ + a 
where we drop the requirement that a, k are integers, i.e., we now allow real numbers > 1. 

Definition 4.5 The inf complexity degree o/ / E TZAi^ is 

df = inf{A: € M>i : / € for some polynomial q of the form q{n) = + b, 

for some 6 > 1 }. 

We also define the inf complexity coefficient Cf of f by 
Cf = inf{C'/(e) : e E K>o}, where 

Cf{e) = inf{a E M>i : / E for some polynomial q of the form q{n) = + a}. 

The inf complexity polynomial of f is the polynomial qj given by qf{n) = cj ■ + 1) (for a// n E N ). 

Since df and cj are defined by infimum, / might not be in By the definition of inf we have the 

following. 

Proposition 4.6 For any polynomial q{n) = an^ + a with k > df and a > cf. f E TZA 4 ^ 2 ^ . □ 

On the other hand, for every ei > 0, £2 > 0: 

/ 0 52^^^ for any polynomial pi(n) = bn'^f~^^ + b with any 6 > 1; 

/ 0 82 ^^^ where p 2 {n) = {cf — £ 2 ) ■ + 1). 

Definition 4.7 Let us choose 61,62 E M>o. A function f E TZAi^ is called (di, (l 2 )-reducible iff 
f E TZA 4 ^ 2 ^ for some polynomial q{n) = (c/ — 62 ) ■ -|- 1). And f is called (di, (52)-irreducible iff 

f is not { 61 , 62 )-reducible. 

In other words, / is (di, (52)-reducible iff / is a composite of elements of i.e., / E 7ZAi^\ where 
q{n) = {cf — 62 ) ■ 1). So, / can be factored into functions that “have strictly lower complexity 

than /” (regarding both the degree and the coefficient). Note that in the definition of df and Cf we 
used ^ 2 '^^ not TZ (Def. 14.5h . 

Proposition 4.8 For all 61,62 E M>o and all polynomials qi,q 2 such that TZAi^ 2 ^'^ ^ TZAi^ 2 ^\ there 
exist (di, (52)-irreducible functions in TZAi^^"^ —TZAd^^K 

Proof. By contradiction, assume that there exist 61,62 such that every / E 7 ZAi^ 2 ^'^ — TZAi^^^ is 
(di, (52)-reducible, i.e., / can be factored as f = fm,o ■ ■ ■ 0 / 1 , where fi E TZAA^ 2 ^^ (i = 1,... ,m) with 
inf degree dj. < df — 6 i and inf coefficient cy. < Cf — 62 . By the contradiction assumption, among these 
factors, those that are in TZAd'^^^ — TZAi^^^ can themselves be factored into elements of degree and 
coefficient lower by amount di, respectively 62 - I.e., a factor fi E TZAi^ 2 ^^ —TZAd^^^^ can be factored as 
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fi = fi,mi o ... fi,i with < df. - (5i and . < Cf^ - 62 ; hence, df^ . <df-26i and Cf^ . < c/ - 2 <52, 
for j = 1,... ,mi. By repeating this process we keep reducing the degree and the coefficient by at 
least (^ 1 , respectively 82 , in each step. After a finite number of steps we obtain a factorization of / into 
functions in contradicting the assumption that ^ □ 

Remark: A finitely generated monoid, like fP, does not contain irreducible functions of arbitrarily 
large complexity. Indeed, all elements are expressible as a composite of elements of bounded complexity 
(namely the maximum complexity of the finitely many generators). 
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